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Abstract. We study the streaming complexity of the membership problem of l-turn-Dyck 2 and Dyck 2 
when there are a few errors in the input string. 

l-turn-Dyck 2 with errors: We prove that there exists a randomized one-pass algorithm that given x 
checks whether there exists a string x' £ l-turn-Dyck 2 such that x is obtained by flipping at most k 
locations of x' using: 

— O(fclogn) space, O(fclogn) randomness, and poly(klogn) time per item and with error at most 
l/n c . 

— 0(k 1+c + logn) space for every < e < 1, O(logn) randomness, (^((log ^ 1 ' n + k°^)) time per 
item, with error at most 1/8. 

Here, we also prove that any randomized one-pass algorithm that makes error at most k/n requires at 
least Q(k log(n/fc)) space to accept strings which are exactly fc-away from strings in l-turn-Dyck 2 and 
to reject strings which are exactly k + 2-away from strings in l-turn-Dyck 2 . Since l-turn-Dyck 2 and 
the Hamming Distance problem are closely related we also obtain new upper and lower bounds for 
this problem. 

Dyck 2 with errors: We prove that there exists a randomized one-pass algorithm that given x checks 
whether there exists a string x' £ Dyck 2 such that x is obtained from x' by changing (in some restricted 
manner) at most k positions using: 

— 0(fclog7i + \Jn log n) space, O(fclogn) randomness, poly(klogn) time per element and with error 
at most l/n c . 

— 0(k 1+c + Vnlog n) space for every < e < 1, O(logn) randomness, 0((log 0(1) n + k° w )) time 
per element, with error at most 1/8. 

1 Introduction 

The data streaming model was introduced in the seminal work of Alon et al. [3] . This model naturally arises 
in situations where the input data is massive and rereading the input bits is expensive. The main parameters 
that play a role in designing algorithms in such situations are: the space used by the algorithm, and the 
number of passes made over the input. An algorithm is said to be an efficient data streaming algorithm, if 
the space used by the algorithm is substantially lesser than the length of the input (sublinear in the length 
of the input) and the number of passes is independent of the length of the input. Many variants of this basic 
model have been studied. (See for example [15] for a survey.) 

The membership testing for well-paranthesises strings has been considered in the past. We denote the 
set of words with balanced parentheses of I different types by Dyck z . It is known that there is a O(logn) 
space deterministic algorithm for testing membership in Dyck;. (In fact the problem is known to be in TC° 
[6].) The problem has been considered from property testing perspective (see for example [2], [12] )• Recently, 
the problem was considered in the streaming model by Magniez et al. [14j . It was proved that there is 
a randomized one-pass streaming algorithm that takes space 0(\/n\ogn) and tests membership in Dyck ; . 
They also gave an efficient 0(log 2 n) space algorithm which makes bidirectional pass (one forward and one 
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backward pass) on the input. They also proved a lower bound of Q{*/n) for any randomized streaming 
algorithm that makes a single unidirectional (only forward) pass. Chakrabarti et al. [8] and Jain et al. [12] 
considered the lower bound problem for unidirectional multi-pass randomized algorithms. In [5] it was proved 
that any T-pass (all passes made in the same direction) randomized algorithm requires Q(^/n/T\og\ogn) 
space. Whereas [12] proved fl{y/n/T) space lower bound for the same. In [5] membership testing for other 
classes of languages was considered. In [3] it was proved that any randomized T pass algorithm (passes made 
in any direction) for testing membership in a deterministic context-free language requires ft(n/T) space. 

We consider a slightly general version of the membership testing problem for Dyck;. Let Si denote a 
set of I pairs of matching parentheses. We say that an opening parenthesis is corrupted if it is replaced by 
another opening parenthesis. Similarly, a closing parenthesis is corrupted if it is replaced by another closing 
parenthesis. For a language L £ , let A- k (L) be defined as the set of words over obtained by corrupting 
at most k indices of any word in L. In this paper, we consider the membership problem for Z\- fe (Dyck;) and 
Z\- fe (l-turn-Dyck;), where 

l-turn-Dyck 2 = {ww R \ w G {(, [}" n > 1} 

Here, W is the string obtained from w by replacing an opening parenthesis by its corresponding closing 
parenthesis and w R is the reverse of w. 

Accepting strings with at most k errors is a well-studied problem in many models of computation. In the 
streaming model, the problem has been studied in the past (see for example Cao et al. [7]). But we believe 
that the problem needs further investigation; this being the primary goal of this paper. 

We observe that the membership testing problem for zA-^Dyck;) (Z\- fe (l-turn-Dyck;)) reduces to the 
membership testing problem of Z\- fc (Dyck 2 ) (Z\- fc (l-turn-Dyck 2 ), respectively). We give a simple fingerprint- 
ing algorithm for Z\- fc (l-turn-Dyck 2 ) that uses O(fclogn) bits of space and randomness. The space require- 
ments of this algorithm are nearly optimal (because of a communication complexity lower bound of ) but 
the randomness requirements are not. We consider the question of derandomizing the above. The question 
of derandomizing streaming algorithms has been considered in the past (see for example [TD],[T5],[18 ,[20 ). 
We show that the algorithm can be modified to work with just O(logn) bits of randomness, incurring a 
small penalty in the amount of space used. We then consider similar questions for the more general problem 
Z\^ fe (Dyck 2 ). The following table summarizes our algorithmic results: 



Problem 


One-pass 
Algorithm 


Space 


Randomness 


Error 


Time (per element) 


l-turn-Dyck 2 


1 

2 


O(fclogn) 

for all < e < 1: 

0(k 1+e +logn) 


0(k logn) 
O(logn) 


inverse poly 
1/8 


poly(k log n) ^ 
0((logn)°« +fcO(i)) 


Dyck 2 


3 
4 


0(fclogn + ^/nlogn) 
for all < e < 1: 
0(fc 1+e + y/n\ogn) 


0(k logn) 
OQogn) 


inverse poly 
1/8 


poly{k\ogn) 4 
0((\ogn)°W +fcO(i)) 



In all the algorithms in the table above, we assume that the length of the input stream is known. 

Using Algorithm 1, we can deduce the number of errors as well as their locations. Using a combination of 
the algorithm for membership testing of l-turn-Dyck 2 due to [TJ] (which we refer to as MMN algorithm) and 
Algorithm 1, it is easy to get a membership testing algorithm for Z\- fc (Dyck 2 ). However, such an algorithm 
uses 0(k\/n\ogn) space. In order to achieve the claimed bound, we modify their algorithm for testing 
membership in Dyck 2 and use that in conjunction with Algorithm 1. In our algorithm, we do not need to 
store the partial evaluations of polynomials on the stack. 

Algorithms 2 and 4 are inspired by the communication complexity protocols of Yao |22] and Huang 
et al. [IT]. A mere combination of their ideas, however, is not enough to get the required bounds. The 
crucial observation here is that Yao's protocol can be derandomized by using a combination of small-bias 

4 In the case of Z\- fc (l-turn-Dyck ; ), this is the exact time per item. However, for Zi- fc (Dyck ; ) it is the time per item 
on average. In the latter case, the algorithm first reads a block and then uses O (poly (k logn)) time per element of 
the block. Therefore, the time per block is 0{poly(k\og n)^n/ log n). Both algorithms use an extra post-processing 
time of n k+0(1 K 
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distributions and distributions that fool DNF formulae. As this requires very few random bits, we get the 
desired derandomization. These algorithms are also better as compared to Algorithm 1 and 3 in terms of 
their time complexity. For Algorithm 2, we first prove that it suffices to give an efficient algorithm for Ham n k, 
where Ham rit k(x, y) for x, y £ {0, 1}™ is 1 if and only if the Hamming distance between x and y is at most k. 

Finally, we consider the question of optimality. We prove that any algorithm that makes k/n error requires 
S7(klog(n/k)) space to test membership in Z\- fe (l-turn-Dyck 2 ) by proving a lower bound on Ham„.fc. The 
two problems are related as follow: Let w £ S 2n and let w = uv where u 6 {(, [}" and v £ {),]}"■ If ( and ) 
are both mapped to and [ and ] are both mapped to 1 to obtain x, y from u, v then it is easy to see that 
uv £ Z\- fc (l-turn-Dyck 2 ) if and only if Ham n ^(x, y) = 1. 

The problem Ham^fc was considered in [22], [IT], in simultaneous message model. In [TT], a lower bound 
(in fact, a quantum lower bound) of Q(k) was proved for the problem. Their lower bound holds even for 
constant error protocols. To best of our knowledge no better lower bound is known for the problem. We 
improve on their lower bound by a log(n/fc) factor under the assumption that the communication protocol 
is allowed to make small error. Our lower bound can be stated as follows: 

Theorem 1. Given two strings x,y £ {0, 1}™ such that either the Hamming distance between x,y is exactly 
k or exactly k + 2, any randomized one-pass algorithm that makes error k/n requires space Q{k\og{n/k)) to 
decide which one of the two cases is true for the given x, y pair. 

For the lower bound, we use the result of Jayram et al. [JJ5]. Intuitively, the hardest case seems to be 
to distinguish between exactly k and exactly k + 2 errors. The main advantage of our lower bound proof is 
that it formalizes this intuition. Moreover, as our algorithm in Section 3 shows, this bound is tight up to a 
constant factor for n > k 2 (indeed, for n > k 1+e for any e > 0). This bound is not tight in all situations 
though, for example when but the error is constant. Also, it does not apply to multi-pass algorithms. 

However, this is better than the earlier bounds by a factor of log(n/fe) for small error. 

The rest of the paper is organized as follows: in the next section we give some basic definitions which will be 
used later in the paper. In Section[3]we give the two randomized one-pass algorithms for testing membership 
in Z\- fc (l-turn-Dyck 2 ). In [4] we discuss our results regarding testing membership in Z\- fc (Dyck 2 ). Our lower 
bound result is presented in Section [5] 



2 Definitions and Preliminaries 



2.1 £-wise independent hash families 

Definition 1. Given positive integers £, n, m, and s, a function F : {0, II s — > [m] n is an £-wise independent 
hash family if given any distinct 11,1%, ... £ [n] and any (not necessarily distinct) ji, j2, ■ ■ ■ , je S [ m ]j w £ 
have 

Pr [F(r)(h) = h A F(r)(i 2 ) = j 2 A ••• A F(r)(i*) - jt] = —„ 
re{o,i} s m 

where F(r) is interpreted as a function mapping [n] to [m] in the obvious way. 

Lemma 1. \21]j For any £,n,m, there is an i-wise independent hash family F : {0, 1} S — > [m] n , with 

s = 0(£log(n + m)) with the property that there is a deterministic algorithm which, on input r S {0, 1} S and 
i G [n], computes F(r)(i) in time poly(s) using space O(s). 



2.2 Some pseudorandom distributions for restricted tests 

Given m G N, we will denote by U m the uniform distribution on {0, l} m . 

Definition 2. Given any class T of boolean functions defined on {0, l} m , distributions D\, D 2 over {0, 1} T 
and 5 £ [0, 1], we say that D\ 5-fools T w.r.t. D 2 if for all f £ T ' , we have 



Pr [f(z) = 1] - Pr [f(z) = i; 



< 8 
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The above concept has been widely studied for many classes of functions, especially in the case when D 2 
is the uniform distribution U m . When £> 2 is the uniform distribution, in many cases, we know of distributions 
Di with very small support that are nonetheless able to fool some interesting class of functions J- w.r.t. Z?2- 
We note two such results below, since we will need them later. 

Definition 3. Given two vectors x, y € {0, l} m , denote by (x, y) the F 2 -inner product between x and y: that 
is, (x, y) — @j XiDi. For w G {0, define the function L w : {0, l}" 1 — > {0, 1} as follows: L w (x) — (w, x). 
The class of Linear tests on {0, l} m is defined to be the class of functions {L w \ w € {0, l} m }- 

Lemma 2 (Small bias spaces). J7|/ Given any 8 £ K >0 and m £ N, there exists an explicit function G\ : 
{0, 1} S — > {0, 1}™ for s — 0(\og(m/S)) such that the distribution G\(r) for a randomly chosen r £ {0, 1} S 8- 
fools the class of Linear Tests w.r.t. the uniform distribution lA m . Moreover, there is a deterministic algorithm 
A that, given r € {0, 1} S and i £ [m], computes the ith bit of G\(r) in time poly(s) and space 0{s). 

The existence of the algorithm A as stated in Lemma [5] is not formally stated in [T] but easily follows 
from Construction 3 of such spaces in the paper. We call a G\ as described above a 8-biased space over 
{0, 1}™. The distribution G\{r) for a randomly chosen r is said to be a 8-biased distribution. 

The second class of tests we will need to fool is the class of read-once DNF formulae over {0, l} m . It 
has been proved recently that <5'-biased distributions for small enough 8' can be used to <5-fool the class of 
read-once DNFs w.r.t. the uniform distribution. 

Lemma 3 (Fooling read-once DNFs). Given any 8 £ R >0 and m £ N, any 8' -biased distribution 8- 
fools the class of read- once DNFs over {0, l} m w.r.t. the uniform distribution U m , as long as 8' < m0(l0 g (1/l5)) . 

By Lemmas and 02 we have 

Corollary 1. Given any 8 £ K >0 and m€N, there exists an explicit function G2 ■ {0, 1} S — > {0, l} m for 
s = 0(logmlog(l/(5)) such that the distribution G2(r) for a randomly chosen r £ {0, 1} S 8-fools the class 
of Linear Tests w.r.t. the uniform distribution U m . Moreover, there is a deterministic algorithm that, given 
r £ {0, 1} S and i £ [m], computes the ith bit of G2(r) in time poly(s) and space O(s). 



3 Equivalence with errors 

In this section, we consider the problem of testing membership in Z\- fe (l-turn-Dyck ; ). Magniez et al. |14) . 
showed that it suffices to design efficient streaming algorithms for testing membership in Dyck 2 in order to 
get efficient streaming algorithms for testing membership in Dyck ; . Formally, 

Lemma 4 ( [14j ) . If there is a one-pass streaming algorithm for testing membership in Dyck 2 that uses space 
s(n) for inputs of length n, then there is a one-pass streaming algorithm for testing membership in Dyck ; 
that uses space 0(s(nlogl)) for inputs of length n. 

We first prove a lemma similar to Lemma 131 to state that it suffices to design an efficient streaming 
algorithm for Z\- fe (I-turn-Dyck 2 ) (Z\- fc (l-turn-Dyck 2 )) in order to get an efficient streaming algorithms for 
Z\^ fc (I-turn-Dyck;) (respectively, Z\- fe (Dyck;)). 

Lemma 5. // there is a one-pass streaming algorithm for testing membership in A- 2k (l-turn-Dyck 2 ) 
(A- 2k (Dyck 2 )) that uses space s(n) for inputs of length n, then there is a streaming algorithm for test- 
ing membership in Z\- fc (l-iwn-Dyck;) (Z\- fc (Dyck ; ) y ) that uses space 0(s(nl)) for inputs of length n. 

Proof. We use a distance preserving code for this. We encode an opening parenthesis of type (i by ( l_1 [ ('~\ 
And we encode a closing parenthesis of type )i by ] Now given a string w £ Sf, the new string w' 
is over the alphabet S — {(, [,),]}. And \w'\ — nl. Also for every mis-match in w, w' has two mis-matches. 
Thus the lemma. □ 
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Let Z?- fc (l-turn-Dyck 2 ) be the set of string obtained by changing at most k symbols of words in 
l-turn-Dyck 2 . Assuming that the length of the string is known, the membership testing for D- fc (l-turn-Dyck 2 ) 
(which is more general than Z\- fe (l-turn-Dyck 2 )) can also be handled by the techniques introduced in the 
paper. If the input string has opening parenthesis in the first half of the string, then it is considered to be 
an error. It is easy to keep track of such errors. 

We now note that Z\- fe (l-turn-Dyck 2 ) on inputs of length n reduces to the problem Ham n / 2k . 

Lemma 6. There is a deterministic one-pass streaming algorithm that uses space O(logn) and time O(n), 
which given as input a string w £ {(, [,),]}", outputs a pair of strings x, y £ {0, l}™/ 2 and either accepts 
or rejects. If the algorithm rejects, we have w £ A- k (l-turn-Dyck 2 ) . Otherwise, we have A{x,y R ) < k iff 
w £ A- k (l-turn-Dyck 2 ) . 

Proof. Given an input of length n, the algorithm scans its input from left to right, and outputs on seeing 
"(" and 1 on seeing "[" in the first n/2 symbols of its input u>; similarly, on the second half of w, the algorithm 
outputs on seeing ")" and 1 on seeing "]". The algorithm rejects either if it sees the closing braces in the 
first half of its input or the opening braces in the second half of its input (in this case, an opening brace has 
been corrupted by a closing brace or vice versa) and accepts otherwise. If the algorithm accepts, we see that 
A(x,y R ) is exactly the distance of the input from a string in l-turn-Dyck 2 . The lemma follows. □ 

The above lemma shows that it suffices to come up with a streaming algorithm for the Hamming distance 
problem to solve the problem Z\- fc (l-turn-Dyck 2 ). Once we have such an algorithm, we simply run the above 
reduction on an input w £ {(,[,), ]}", and obtain strings x, y R , which we feed in as input to the algorithm for 
Ham„/ 2jfe (of course, if the reduction rejects, we reject the input). Though Ham,,^ is only a minor restatement 
of zl- (l-tum-Dyck 2 ), we prefer to work with this problem because of its cleaner definition. 

Theorem 2. For any k and any constant c > 0, there is a one-pass randomized streaming algorithm which, 
when given as input strings (x,y R ) £ {0,1}" x {0,1}™, that accepts with probability 1 if A(x,y) < k and 
rejects with probability 1 — l/n c if A(x,y) > k. The algorithm also detects the locations where x and y 
differ with probability at least 1 — l/n c if A(x,y) < k. The algorithm uses O(klogn) space and 0(k\ogn) 
randomness. The time required by the algorithm is poly(klogn) per item plus n k+ °^ for post-processing. 

Proof. The algorithm uses a fingerprinting strategy and is directly inspired by the standard randomized 
communication complexity protocol for the Equality problem (see |13j . for example). Fix a field F 2 «, where 
the exact value of I will be determined later. We call a polynomial p[z) £ ¥ 2 t [z] boolean if all of its coefficients 
are or 1. The weight of a boolean polynomial p will be the number of non-zero coefficients of p. 

We think of w £ {0, 1}™ as defining a boolean polynomial p w (z) £ ¥ 2 i [z] as follows: p w (z) — Y2%=i w^ 1-1 ; 
where Wi denotes the zth bit of w. Note that the polynomial q x , y (z) := p x (z) + p y (z) is a boolean polynomial 
of weight exactly A(x,y). We check that A(x,y) < k by evaluating q xs (z) at a random a £ F 2 ?. More 
formally, the algorithm is: 

— Pick a £ ¥ 2 e uniformly at random. 

— Check if q x ^ y (a) = p(a) for any boolean polynomial p of degree less than n and weight at most k. If not, 
REJECT. " 

— If the above does hold for some boolean polynomial of weight at most k, ACCEPT and pick any such 
polynomial p(z) = ^YlnPiZ 1 . Let S = {i \ pi ^ 0} be the support of p. Output S as the estimate of 
points where x and y differ. 

Let us first establish the correctness of the above algorithm (assuming I is large enough). Clearly, if 
A(x,y) < k, then q x y (z) is a polynomial of weight at most k and the algorithm always accepts. The 
algorithm can only err if: (a) A(x,y) > k or (b) A(x,y) < k but the algorithm outputs the wrong set of 
indices as its estimate of where x and y differ. In either case, there is a boolean polynomial p(z) of degree 
less than n and weight at most k such that q XjV (z) ^ p(z) but q x ^ y (a) = p(a). For any fixed polynomial p{z), 
this happens with probability at most n/2 e by the Schwartz-Zippel Lemma. Since the number of polynomials 
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of weight at most k is at most n k , the probability that there exists any such polynomial p is bounded by 
n k+1 /2 i . Choosing t — O(klogn), we can reduce this error to l/n c as claimed. 

Computing q XjV (a) can easily be done in a one-pass fashion using space 0(1) = O(fclogn) and time 
polyik log n) per item. After reading the stream, we need to cycle through the n k boolean polynomials p 
of weight at most k and compute the values they take at input a £ F 2 * , which can also be done in space 
0(fclogn + ^) = O(klogn) and time n fc poly(fc logn) = n k+0 ^\ as claimed above. This completes the proof 
of the theorem. □ 

3.1 A randomness-efficient streaming algorithm for Hamming Distance 

Above, we showed that Ham^ can be computed using space 0(k\ogn) and O(klogn) random bits. Are 
these parameters optimal? As we will show later in Section^ the bound on space is nearly optimal. However, 
we show in this section that the number of random bits can be significantly reduced, if one is willing to use 
a small amount of additional space. The ideas in this section go back to the results of Yao [22] and Huang et 
al. who designed efficient randomized communication complexity protocols for the two-party problem 
of checking if the Hamming Distance between x and y is at most k. 

Let PHarrin^; : {0, 1}" x {0, 1}™ — > {0, 1} be a partial function, defined as follows: On input (x,y R ) it 
evaluates to if the hamming distance between x and y is greater than or equal to Z, it evaluates to I if the 
distance is less than or equal to k and is not defined on other inputs. 

Theorem 3. For every constant < e < 1 there is a randomized one-pass streaming algorithm that computes 
Ham„.fc using 0(k 1+e +logn) space and O(logn) randomness and errs with probability bounded by 1/8. The 
time taken by the algorithm is 0(\ogn) ^ + fc ^ 1 )) per item. 

Proof Strategy: In order to prove the above theorem, we divide the problem into two parts. Assuming 
there are at most 2k errors, we design an algorithm that computes Ham„ ^ correctly with high probability. 
We call this the inner algorithm. We design another randomized algorithm to compute PHam„ ^,2/0 with high 
probability. We call this algorithm the outer algorithm. 

We output 1, that is we declare that the number of indices where x and y differ is at most k, if and only 
if both the inner and the outer algorithms output I. If x and y differ on more than 2k indices, then the outer 
algorithm will output with high probability. The answer of the inner algorithm will not be reliable in this 
case. Where as if they differ on more than k but less than 2k places then the inner algorithm will output 
with high probability. Let "fi , 72 be errors in inner and outer algorithms respectively. Then the overall error 
7 is bounded by "fi +72- We prove that both "fi and 72 are bounded by 7/2 for a small constant 7. 

Inner algorithm 

Definition 4. Given, k,n £ N, we say that an element w £ ([k] x {0,1})™ is an XOR representation of 
length n of a string a £ {0, l} k if for each j £ [k], we have aj — ©^.^—y u a u i- 

We think of the XOR representation as streaming updates of a structure over Fa. 

Lemma 7. There is a randomized one-pass streaming algorithm which given input x, y R £ {0, 1}™ such that 
A(x,y) < 2k computes an XOR representation of length n of a,b £ {0,f} 16fe I 1 such that with probability 
1 — 7/4, Ham n .fe(x,y) = Ham 16 j.2^ 7jfc (a,!)) The algorithm uses O(logn) bits of randomness, O(logn) space, 
and (\og(n/j))°^ time per item. 

Proof. The proof is simple. We pick a random hash function h from a pairwise independent hash family of 
functions mapping [n] to [I6fc 2 /7]. We think of h as dividing the n indices of x and y into 16fc 2 /7 buckets. 

Given x,y such that A(x,y) < 2k, call index i good if Xi ^= yi. Given two good indices i ^ j, the 
probability that h maps both of them to the same bucket is at most 7/I6fc 2 . A simple union bound tells us 
that with probability I — 7/4, all the good indices are mapped to different buckets. 
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After having picked h, the streaming algorithm computes the XOR representations of a, b defined as 
follows: for any j, cij is the XOR of the bits of x whose indices are in the jth bucket; formally, <Zj — 
®i-h(i)=j Xi > ^ ne string b is similarly related to y. Clearly, if h maps the good indices to different buckets, 
then a,j ^ bj iff the jth bucket contains a good index and hence A(a, b) = A(x, y). On reading the input bit 
Xi, the algorithm computes the bucket j — h(i) and writes down (j,Xi) which in effect updates the jth bit 
of a. In a similar way, when scanning y, the algorithm updates b. 

The space and randomness requirements are easily analyzed. Picking a random hash function h from 
a pairwisc independent family as above requires 0(max{logn, log(fc 2 /7)}) = 0(log(n/7)) random bits by 
Lemma [T] The algorithm needs to store these random bits only. Computing h(j) for any j only requires 
space 0(logn/7). Finally, the processing time per element is O (poly (log (n/7)). □ 

We will use the above algorithm as a streaming reduction and solve the problem using the algorithm of 
Lemma [H 

Lemma 8. For any n,k and every constant < e < 1 and 7 > fc0 1 (1) , there is a randomized one-pass 
streaming algorithm which, on inputs a,b £ {0, l} 16fc h accepts iff A(a,b) < k with error probability at most 
j. The algorithm uses O(logfc) bits of randomness, 0(k 1+e +logn) space, and time per element is k°^ . 
The algorithm expects its inputs a,b to be given in terms of XOR representations of length n. 

Proof. First, we present the algorithm for the special case when n = k and the input is simply the pair of 
strings a, b in the natural order of increasing indices. We will then explain the simple modifications that are 
necessary for the case when the input is a pair of XOR representations of length n. 

Fix a positive constant S < e. Let h : [16fc 2 /7] — > [k 1+s ] be a function picked at random. Let j £ [k 1+s ] 
be a fixed bucket. We have Pi[h(i) = j] = j^w- 

Define a set / of indices as follows: if A(a, b) < k, then let I be the indices where a and b differ; otherwise, 
let / be any set of k + 1 indices where a and b differ. Let u be the size of a subset U of /. We have 

Pr[h(U) = j] < jprxy- By a union bound over U of size u, Pr[3f7 : h(U) = j] < ^1+6^ < ^rays < ^72 ■ 
Therefore, since there are at most k 2 buckets, Pr[3U 3 a bucket j : h(U) = j] < l/k Su / 2 ~ 2 . 

We want this probability to be less than 7/8. Therefore we select u = 0(1/6 + l -^L/jl) = 0(1), where 

the last equality uses 7 > fc0 1 (1 ) and S is a constant. 

Note that the above argument works if we used a function from a it-wise independent family of functions 
rather than a random function. This requires only 0(u\og(k/"f)) = O(logfc) bits of randomness and space 
0(log(fc/7)) by Lemma[TJ Hereafter, we assume that we have picked a hash function h from this family so 
that each bucket j £ [k 1+s ] contains at most u indices from /. Let Bj and be the buckets formed by 
hashing a and b respectively, where 1 < j < k 1+s . 

Given boolean strings a', b' of the same length, define A u (a! b 1 ) to be min{Z\(a', b'), u}. We will compute 
the function F(a, b) — A u (Bj,Bj) and accept if the value computed is at most k. It can easily be 

seen, using the properties of h, that this computes Ham 16fc 2 / 7jfc (a, 6). 

Computing A u (Bj, Bj) for any j is easily done using the ideas of the algorithm of Theorem[2j We work 
over the field F 2 « where £ is a parameter that we will fix shortly. Given a polynomial p £ F 2 f [z] with only 
0-1 coefficients, we denote by the weight of p the number of non-zero coefficients of p. Given c £ {a, b} 
and j £ [k 1+s ], the bucket Bj defines for us the polynomial Pj. c (z) = Y^ieh- 1 ^) CiZ% over ~^ 2t ' -Define the 
polynomial qj(z) = Pj ia (z)+pj j b(z). The weight of qj is exactly A(B", Bj). The algorithm to compute F(a, b) 
is the following: 

1. Pick a £ V 2 t uniformly at random. 

2. For each j, compute qj(ct) and check if it evaluates to the same value as some polynomial p of weight at 
most u. If so, let Wj be the weight of an arbitrary such p; if not, let Wj = u. 

3. Output J2j w j- 

The above algorithm errs on bucket j only if there is a polynomial p 7^ qj of weight at most u such 
that p(a) = qj(a). This occurs with probability at most O(-^j) for a fixed polynomial p and hence with 
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probability at most ' 2g for some polynomial p of weight at most u after a union bound. After taking a 

union bound over buckets, we get a failure probability of at most ^-0$ (with different constants in the 

exponents). Choosing £ = 0(w log(-)) = O(logfc), we can reduce the error to 7/8. 

The overall error of the algorithm is at most 7/L The space used per bucket and the number of random 
bits used is at most 0(logk). Adding up over all buckets, the space used is bounded by 0(k 1+s log A;), which 
is at most 0(k 1+t ). The time taken by the algorithm to compute the values {qj(ct) \ j £ [k 1+s ]} is k°^\ 
Finally, checking if each qj (a) evaluates to the same value as a polynomial of weight at most u takes time 

k O(u) = k O(l) i 

Now for the case when the input is given as a pair of XOR representations of length n. We simply 
note that the polynomials pj^ a and p^b are still easy to compute. For example, on reading the ith element 
Wi — (ji,Ui) of the XOR representation of a, the algorithm simply updates the current value of Ph(ji),a{&) 
by adding UiU Ji to it; this works as intended since F 2 « is a field of characteristic 2. The algorithm only needs 
an additional counter that counts up to n so that it knows when the XOR representation of a ends. □ 

Setting 7 to be 1/8 in the Lemmas [7] and [51 we see that the space taken by the Inner algorithm overall is 
0(fc 1+e + logn), the amount of randomness used is O(logn) and the time taken per item is 0((logn) ^ + 



Outer algorithm Given x,y £ {0,1}™, we denote by {x,y} the F 2 -inner product of x and y. Formally, 
(x,y) = ®i =1 Xiyi. 

Lemma 9. There is a randomized one-pass streaming algorithm that computes PHarrin^fc correctly 
with probability 1 — 7/2 using 0(lognlog(l/7)) bits of space and randomness and time per item 
(logn)°( 1 )log(l/7). 

Proof. For simplicity, we will assume that k is a power of 2. All the results carry through in the general case, 
with only superficial changes. 

We use a protocol of Yao [22] . Yao devised a one-wajH randomized communication complexity protocol 
using which two players Alice and Bob, given inputs x and y respectively, can decide P\-\am n ^,2k(x, y) using 
(9(log(l/7)) bits of communication. A brief sketch follows. Let u denote x®y- Using public randomness, Alice 
and Bob pick random strings zi,...,ze £ {0, 1}™ for I = 0(log(l/7)) such that each bit of each Zj is set to 
1 independently with probability 1/Ak — we call this distribution Z?i/4fc. Alice computes (x, Zi), . . . , (x, z&) 
and sends them to Bob who uses them to compute (u, z±), . . . , (u, zi). 

Let Zi be picked from -Di/4&. It is easily checked that if A{x,y) < k, then (u, Zi) takes value 1 with 
probability at most p\ = ^ 1 /^^° k ^ m On the other hand, if A(x,y) > 2k, then (u, Zi) takes value 1 with 
probability at least P2 — ( 1 ~ 1 / e +° k ( 1 ^ . Thus, by performing a suitable threshold on the number of i such 
that (u, Zi) — 1 — say by checking if the number of Is is at least 2i±E2. — B b can compute PHarrin^^fc^j y) 
correctly with error probability at most S . 

Using the above ideas, we wish to come up with a streaming algorithm for this problem that is also 
randomness efficient. Both these constraints require us to change the original protocol. (Note that the 
obvious implementation of the above protocol in the streaming setting will require f2(n£) bits of space and 
randomness.) To reduce the amount of randomness, we run Yao's protocol with pseudorandom z from a 
distribution D that fools the linear test defined by the string u (see Section [2] for the definition of a "linear 
test"). Formally, we want a distribution D over {0, 1}™ such that for any w £ {0, l} n 



Pr [{w, z) = 1] — Pr \{w,z) 

z~D ' z~D 1 / 4k 



< 1/100 



That is, we want a distribution D that 1/100-fools linear tests w.r.t. Di/^. Furthermore, we would like 
to be able to sample from D using a small number of random bits. 



5 Actually, Yao's protocol works in the more restrictive simultaneous message model, but this fact is not relevant 
here. 
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Fix w £ {0, 1}™. Let f w be the following related test, denned on {0, l} r 



fw(Zn, Z 12 , . . . , Z u , Z 21 , Z 22 , • ■ • , Z 2tl . • . , Z n i, Z n2 , . . . , Z nt ) — ( £y 



=1 j=l 



where 2* = 4fc. Note that Pi ze rj 1/4k [(i 



= 1] = Pr 



[fw{z') — 1], where £/ r denotes the uniform 



distribution on {0, 1}'". 

We will first design a distribution D' over {0, 1}"* that 1/100-fools the family of tests {f w \ w £ {0, 1}"} 
w.r.t. the uniform distribution U n t- 

Now, we describe D'. We break the variables {zL} into two blocks: B\ := {z' n \ i £ [n]} and B 2 '■= {z^ \ i £ 
[n],j 7^ 1}. Consider the test f w evaluated at a random input z' £ {0, 1}"*. It is helpful to view this evaluation 
as a two-step process: In the first step, we substitute a uniform random string z 2 for the tuple of variables in 
B 2 . After this substitution, f w (-, z' 2 ) becomes a linear function on the variables in B±. If this linear function 
is the zero linear function, then f w (z[, z 2 ) cannot evaluate to 1 on any setting z[ of the variables in B^. On 
the other hand, if this linear function is non-zero, then f w (z[, z' 2 ) evaluates to 1 with probability exactly 1/2 
over the choice of z[. Putting things together, we see that Pr z >[f w (z') = 1] = (1/2) Pr 2 / [g w (z' 2 ) = 1], where 
g w is the following read-once DNF formula on the variables in B 2 that tells us exactly when /„, becomes a 
non-zero linear function on the variables in B\\ g w — Vi-io 4 =i Aj=2 z 'ij 

This tells us that we only need to fool read-once DNFs and linear tests w.r.t. the uniform distribution 
to fool /^,-tests w.r.t. the uniform distribution. We will generate z' £ D' as follows: z[ will be sampled from 
an explicit i5i-biased space D' x and z 2 will be independently sampled from an explicit space D' 2 that ^-fools 
read once DNFs. We have: 



Pi ■[/„(*') = 1] - Pr [/ w (z') = i; 

Z ' t*-> D Z ^Unt 



Pr [g w {z 2 ) = 1] • Pv [f w (z[,z 2 ) = 1 | g w {z' 2 ) = 1] - 

< 

<5i + 5 2 



Pr 



{t -i)[9w{z 2 ) = 1] 



Pi .[g w (z' 2 ) = l]- Pr [g w (z' 2 ) = l] 

~V 2 z 2 ~W„ (t _i) 



Pi .[f w (z[,z 2 ) = l | g w (z' 2 ) = l}-~ 

1 ~-Dt z 



where the first inequality uses the fact that \pq— rs\ < \p— r\ + \q— s\ for any p, q,r, s £ [0, 1], and the second 
inequality follows from the definitions of D' Y and D' 2 . Choosing 8\ and S 2 to be small enough constants, we 
obtain a pseudorandom distribution D' that 1/100-fools /^-tests w.r.t. the uniform distribution. By Lemma 
[2] and Corollary [1] the amount of randomness required for the above is O(logn). Using D' , we can define a 
distribution D that fools linear tests w.r.t. Di/ ik as follows: to pick z ~ D, we pick z' ~ D' and output z 
defined by Zi — A* =1 z^- for each i. It is easily seen from the definition of the tests f w that the distribution D 
1/100-fools all linear tests w.r.t. D 1 / 4k . Since no additional randomness is used to generate D, the amount 
of randomness used is O(logn). Moreover, by Lemma [2] and Corollary [IJ given a random seed r of length 
O(logn) and j £ [n], the jth bit of the output of D on this random seed can be generated using O(logn) 
bits of space in time poly(logrt). 

With the pseudorandom distribution D in place, we can run Yao's protocol in the streaming setting with 
independent random strings z\,...,zt picked from the distribution D. Exactly as above, for suitably chosen 
I = 0(log(l/7)), this algorithm computes P\-\am n ,k,2k(x,y) with error probability at most 7/2. The space 
and randomness used are both 0(lognlog I/7) and the time taken is npoly(logn) log(l/7). Setting 7 = 1/8, 
this proves the lemma and also concludes the proof of Theorem |3l □ 



4 Accepting Dyck 2 with errors 



In this section we consider the membership problem of A- (Dyck 2 ). We assume that disregarding the type 
of the brackets the string is well-matched. We only consider the kind of errors where an opening(closing) 
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parenthesis of one type is replaced by an opening(closing, respectively) parenthesis of the other type. We 
prove the following theorem: 

Theorem 4. For any k there exists a constant c > 0, there is a randomized one-pass algorithm such that, 
given a string w € S n , if w € Z\- fc (Dyck 2 ) then with probability at least 1 — l/n c it accepts w and if 
w (jL Z\- fc (Dyck 2 ), then with probability 1 — l/n c it rejects w. The algorithm uses Oik log n + \J n log n) space 
and takes poly(klogn) time per item and n k+0 ^ time for post-processing. 

It is easy to see that combining the ideas from |14j and from the previous section, we can accept 
Z\- fe (Dyck 2 ) using 0(ky / n\ogn) space. But for the bound stated in Theorem 01 we need more work. 

In [14] a one- pass randomized streaming algorithm was given for the membership testing of Dyck 2 . We 
refer to that as the MMN algorithm. We make one change to the MMN algorithm. We use the stack only to 
store indices from the set [n], and do not store the partial evaluations of a polynomial on the stack. 

Divide the input into Wn/ logn blocks of length y/nlog n each. In each block, check for balanced paren- 
theses and if there are less than or equal to k — Err mis-matches, then reduce the remaining string to a 
string (possibly empty) of closing parentheses followed by a string (possibly empty) of opening parentheses. 
Here, Err is a counter that maintains the number of mismatches found so far. If Err exceeds k, then halt 
the algorithm and reject. 

Let x denote the reduced string obtained by matching parentheses within each block. (Note that this can 
be done in a streaming fashion.) For the reduced string x we say that the opening parenthesis at position i 
has an index h Xy i if it is the /ith opening parenthesis in x. We say that the closing parenthesis has index h Xt i 
if it is the the closing parenthesis that closes an opening parenthesis having index h Xt i in x. We drop x when 
it is clear from the context. 

Observation 1 Note that no two opening (or two closing) parentheses have the same index. 

Also, an opening parenthesis has the same index as another closing parenthesis if and only if they form 
a matching pair in the string obtained from the input string by disregarding the type of the parentheses. 

For example in the input (([])[]) the indices of the opening parentheses would be ( 1 ( 2 [ 3 ])[ 4 ]) and the indices 
of the opening and closing parentheses would be ( 1 ( 2 [ 3 ] 3 ) 2 [ 4 ] 4 ) 1 - If vie reorder the input such that all opening 
parentheses are in the first half with ascending index and the closing parentheses are in the second half with 
descending input we get ( 1 ( 2 [ 3 [ 4 ] 4 ] 3 ) 2 ) 1 - 

We now describe the procedure for computing the index. The index of an opening parenthesis is easy to 
compute. It is a monotonically increasing quantity and can be stored in a O(logn) bit counter say c open . It 
is initialized to and incremented by 1 every time an opening parenthesis is encountered. To compute the 
index of a closing parenthesis we use the stack. The stack is only being used to compute the index. At any 
stage during the algorithm we maintain the intervals of yet to be matched open parentheses on the stack. 

The first block consists of only opening parentheses (assuming we have already reduced the block). After 
processing the first block we push [1, c open ] on the stack to remember the interval of indices yet to be matched. 
(In the case that all parentheses of the first block are matched within the first block, the next block is treated 
as the first block). 

Now suppose we process the next block (also assume that this block is already reduced). A reduced 
block will consist of a sequence of closing parentheses (possible empty) followed by a sequence of opening 
parentheses (possible empty). Recall we maintain the intervals yet to be matched on the stack, say the 
stack-top is [m,m'\. 

If the block begins with a (non-empty) string of closing parentheses, m! is the index of the first closing 
parenthesis, and is decremented after reading the closing parenthesis. As long as m! > m, this continues. If 
m! < m we get the next interval of unmatched parenthesis from the stack. (If the stack is empty the input 
is not well matched disregarding the type of parentheses and we reject the input). If the string of closing 
parentheses ends while m > ml , we push the remaining interval to the stack. 

When reading the string of opening parentheses we let m = c open at the beginning, process all opening 
parentheses, and before moving on to the next block we push [m, c open ] (where c open is updated value after 
reading the string of opening parentheses). 
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While processing closing parentheses, the stack is, if at all, popped but never pushed. While processing 
opening parentheses, a new stack element may be pushed. However, this happens at most once per block, and 
therefore at most yj nj logro times. Also the elements of the stack are tuples of indices so they are O(logn) 
bits. Therefore, the total space needed to compute indices is 0(log n^/nj log n) = 0(\/n\ogn). 

So we can compute the index of the parentheses, and now we show how to use this to compute Z\- fc (Dyck 2 ). 
Now assume that, at any stage i, we have the index of the input symbol x%. Let the sign of the opening 
parentheses be +1 and that of closing parentheses be —1. We think of the reduced string i € {(, [,),]}* as a 
string over {0, 1}* by replacing every occurrence of '(' and ')' by a and every occurrence of '[' and ']' by a 1. 
We think of this string defining a Boolean polynomial p x (z) — J2i sign(xi)xiZ lndc * of Xi . Due to Observation 
[U it is easy to see that the weight of the polynomial p x is at most k — Err if and only if w £ A-(Dyck 2 ). 
We now check whether w £ Z\- fc (Dyck 2 ) by evaluating p x at a random a £ F 2 i . Assuming that we know how 
to compute index of xi at step i, we can evaluate p x as in the proof of Theorem O 

Given below is the algorithm that uses the index finding procedure as described above, and evaluates 
polynomial p x at a random location to test membership of w in Z\- fe (Dyck 2 ). In addition to th space required 
for computing the indices, 0(1) bits are required to store evaluation of p x . But this does not need to be stored 
multiple times on the stack. Therefore, the algorithm uses 0(1 + y/n log n) = 0(k\ogn + \fn log n) space. 

The proof of correctness and the error analysis are similar to the proof of Theorem [2j Thus we get 
Theorem 0J The detailed algorithm is given below. 



1: pick a £ R F p , set sum <— 0, set c open <— 0, set Err <— 0, 

2: for each block do 

3: read the word y consisting of the next \J n log n letters (or less if the stream becomes empty), 

4: check that matching pairs within y have at most k—Err errors (if not, reject: mismatched parentheses), 

if so, update Err to this value, 

5: simplify y into uv , where u has only closing parentheses and v has only opening parentheses, 

6: for i — 1 to |u| do 

7: pop [m,m'] from the stack (reject if nothing to pop), 

8: sum -i— sum — a m if u[i] = ']' (unchanged otherwise) 

9: m' <- m! — 1, 

10: if m > to', push [to, to'], 

11: end for 

12: sum <— sum + > V4tt J , 

i — * J — '-open ■> 

(by abuse of notation, Vj = 1 if Vj ='[' and is otherwise) 

13: push [c open , C open + \v\], 

14: set C p en 4 C p en -\- \v j, 
15: end for 

16: Check sum = p(a) for any polynomial p with 0-1 coefficients of degree less than n and weight at most 
k. If not, REJECT. 



Reducing the randomness The ideas used in reducing randomness for Z\- fe (l-turn-Dyck 2 ) also work for 
reducing randomness in the membership testing of Z\- fe (Dyck 2 ). Here, instead of hashing the input positions, 
we hash the indices using the random hash functions. For computing the indices, we use the procedure 
described above. Instead of computing polynomials, we compute hashes and follow the steps as in Sections 
3.1, 3.2, and 3.3. 

We get the following theorem: 

Theorem 5. For every constant < e < 1, there is randomized one-pass algorithm that tests membership 
in Zi- fc (Dyck 2 ) using 0(k 1+e + \J n log n) space, O(logrt) randomness, 0(\og ^ n + k ^) time and errs 
with probability 1/8. 
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5 Lower bound 



We will show a lower bound for PHam n k,k+2 by reducing the augmented indexing problem IND^ (see [T5] ) 
to it. 

Let U U {_L} denote a large input domain, where U. Dehne the problem IND^ as follows: Alice and 
Bob receive inputs x — (x 1: x 2 , ■ ■ . , xn) G U N and y — (i/1,2/2, • • • , Vn) S (U U {-L})^, respectively. The 
inputs have the following promise: There is some unique i € [iV], such that y, G £L, and for k < i: Xk = yt, 
and for k > i: y^ =_L. The problem IND^ is defined only over such promise inputs and IND^(x, y) = 1 if and 
only if Xi = y*. 

In [T91 Corollary 3.1] they proved the following result: 

Theorem 6 Q19J). Any randomized one-way communication protocol that makes S = \/A\lA\ error requires 
f2(N log 1/5) bits of communication. 

We use this result and prove a lower bound for PHam„,fc,fc + 2. 



Let \U\ = n/k. Let Ja ■ U — > {0, 1} 3 ™/' C ; and f B : (U U {_!_}) — > {0, be encoding functions defined as 



f B (±) =o 3 ™/ fc . 

On promise inputs x,y G let Fa(2;) and Fs(y) be defined as fA(xi)fA(x2)---fA(xk) and 

/b(2/i)/b(2/2) • ■ • respectively. 

Under this encoding, it is easy to see that PHam 3nj 2fe i 2fc+2(-FA(2 ; ) 5 -FB(2/)) = 1 if and only if IND^(x, y) = 1. 
Suppose i + 1 is the first position where _L appears in y. For each j < i we have = yj so the Hamming 
distance of Fa(Xj) and Fsii/j) is 2. Also for every position j > i we have yj = _L and hence Fsiyj) = 3ra//fe , 
which results in a Hamming distance of 2 between ^(2^) and Fb(xj). So the Hamming distance between 
Fa{x) and Fsiy) is 2(/s — 1) plus the Hamming distance between /^(xj) and /s(yi), which is 2 iff Xj = y; 
and 4 iff a;, 7^ y, (since x^y^ eW). 

Therefore we get the following lower bound: 

Theorem 7 (Theorem [TJ restated). ^4ny randomized one-way protocol that makes at most k/n error 
and computes PHam3 nj 2fe,2fc+2 7 requires f2(k\og(n/k)) bits. In fact the lower bound holds for distinguishing 
between the case A(x,y) = 2k and A(x,y) = 2k + 2. 

By Theorem [2] this bound is optimal when n > k 2 (and in fact when n > k 1+e , for constant e > 0). 
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